rank | frequency | n-gram |
---|---|---|
1 | 8748 | -ς |
2 | 4253 | -ν |
3 | 3881 | -α |
4 | 3061 | -ι |
5 | 2328 | -ε |
rank | frequency | n-gram |
---|---|---|
1 | 1363 | -ας |
2 | 1311 | -ου |
3 | 1249 | -ης |
4 | 1195 | -ει |
5 | 1187 | -ες |
rank | frequency | n-gram |
---|---|---|
1 | 804 | -ται |
2 | 721 | -ουν |
3 | 553 | -ική |
4 | 469 | -ικό |
5 | 439 | -ικά |
rank | frequency | n-gram |
---|---|---|
1 | 345 | -εται |
2 | 345 | -ουμε |
3 | 342 | -νται |
4 | 296 | -ικής |
5 | 285 | -ντας |
rank | frequency | n-gram |
---|---|---|
1 | 251 | -ονται |
2 | 196 | -οντας |
3 | 139 | -θηκαν |
4 | 127 | -τικές |
5 | 126 | -τικών |
The tables show the most frequent letter-N-grams at the ending of words for N=1…5. Everything runs in parallel to 2.2.5 Most frequent word beginnings. The aim is suffix detection instead of affix detection.
For N=3:
SELECT @pos:=(@pos+1), xx.* from (SELECT @pos:=0) r, (select count(*) as cnt ,concat("-", right(word,3)) FROM words WHERE w_id>100 group by right(word,3) order by cnt desc) xx limit 5;
2.2.5 Most frequent word beginnings